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A  NUMERICALLY  EFFICIENT  DIGITAL 
MATCHED  FILTER  FOR  PERIODIC  AND 
WINDOWED  PERIODIC  RADAR  WAVEFORMS 


I.  INTRODUCTION 

A  network  whose  frequency  response  function  maximizes  the  output  peak  signal-to-mean  noise 
power  (S/N)  ratio  is  called  a  matched  filter.  Almost  all  radar  receivers  are  designed  with  the  matched 
filter  criteria.  If  hit)  is  the  impulse  response  function  of  the  matched  radar  receiver,  s (t )  is  the 
transmitted  radar  waveform,  and  the  noise  interference  is  white  and  additive,  then  it  can  be  shown 
[1-3]  that 


h(t)  =s*i~t),  (1) 

where  *  denotes  the  complex  conjugate  operation. 

A  digital  matched  receiver  design  is  based  on  the  same  principle  of  maximizing  (S/N).  We  sam¬ 
ple  the  transmitted  radar  waveform  at  equal  time  intervals  7.  Let  su  s2 . sN  be  the  values  of  the 

sampled  transmitted  waveform  where  N  is  the  number  of  sampled  points  as  seen  in  Fig.  1.  We  set 

s  =  (Sj,  s2,  ■ . .  ,  sn)t  ,  (2) 

where  T  denotes  the  vector  transpose  operation.  The  digital  receiver  applies  a  weighting  vector 
w  =  (wj,  w2,  ...  ,  wn)t  such  that 


y  =  wTs. 


(3) 
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It  can  be  shown  that  (S/N)  is  maximized  when 

w  =  s*  .  (4) 

The  matched  receiver  structure  can  be  represented  as  seen  in  Fig.  2.  Here  x(n)  is  the  received  time- 
sampled  input  sequence  consisting  of  signal  plus  noise.  The  received  input  sequence  is  digitally 
convolved  with  the  conjugated  time-reversed  sequence  of  the  vector  s,  denoted  by  §*,  which  results  in 
a  matched  output  sequence  y(n).  Mathematically,  this  can  be  stated  as 


y(n)  =  £  sk  +  ix(n  +  *)• 

k  =  0 


(5) 


x(n) 


MATCHED 

FILTER 

~  + 
s 


y(n) 


Fig.  2  —  Simplified  digital  matched  filter 


Many  surveillance  radars  employ  a  periodic  waveform  as  illustrated  in  Fig.  3.  The  waveform  is 
“on”  for  a  given  duty  cycle  and  then  turned  “off”  to  receive  the  reflected  echoes.  The  “on”  por¬ 
tion  of  the  waveform  is  subdivided  into  N  cells  where  the  n  th  cell  has  the  value  sn .  Each  cell  is  r 
seconds  long,  where  r  is  proportional  to  the  range  resolution  cell.  In  fact,  it  can  be  shown  [3]  that  if 
j8  is  the  bandwidth  of  the  radar  waveform,  then  t  ~  1/(3.  Let  there  be  M  range  resolution  cells  in 
the  pulse  repetition  interval  (PRI).  It  can  be  shown  that  PRI  =  Mr. 


M  RANGE  CELLS  PRI  =  Mr 


Fig.  3  —  Periodic  radar  waveform 


The  digitally  matched  receiver  of  this  periodic  waveform  can  be  implemented  by  using  the  linear 
convolution  operation  as  given  by  Eq.  (5)  or  it  can  be  implemented  by  using  fast  convolution  tech¬ 
niques  [4],  In  this  report,  we  introduce  a  processing  method  (based  on  circular  convolutional  tech¬ 
niques)  that  is  numerically  more  efficient  than  the  fast  convolution  techniques  when  the  duty  cycle  is 
not  small.  In  fact,  for  50%  duty  cycle  waveforms,  the  technique  uses  half  as  many  complex  multipli¬ 
cation  operations  (CMOPs)  as  do  the  fast  convolution  techniques.  The  circular  convolution  algorithm 
is  presented  in  Section  II  and  is  compared  with  a  fast  convolution  technique  in  Section  III. 

Note  that  if  the  radar  waveform  pulse  train  is  a  long-windowed  periodic  function,  then  the  circu¬ 
lar  convolution  technique  is  also  applicable  with  some  (S/N)  losses  occurring  at  the  leading  and  trail¬ 
ing  edges  of  the  received  waveform. 
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n.  MATCHED  FILTER  IMPLEMENTATION 


In  this  section,  we  present  a  matched  filter  implementation  based  on  circular  convolution  tech¬ 
niques.  Consider  the  simplified  digital  receiver  structure  shown  in  Fig.  4.  The  return  signal  is  x(t) 
and  is  sampled  each  r  seconds.  Each  sample  is  successively  stored  in  a  shift  register  (SR)  until  M 
samples  are  taken.  We  match  the  filter  to  those  M  samples.  After  matching,  the  SR  is  reloaded  with 
the  next  M  samples  of  x(t),  matched,  and  so  on.  Note  that  when  this  implementation  scheme  is 
employed,  in  most  cases  we  are  matching  the  received  data,  jc j ,  x2,  .  .  .,  across  two  PRIs  For 
example,  with  no  noise,  Xij=  sm,  x2  =  s  +i,  .  .  .,  xN_m  +  l  =  sN,  xN_m+2  =  0,  xN_m+3  =  0,  .  . 

.,  *M-m _3  0,  xM _m  _2  —  Ji,  xM _m  _ j  —  s2,  .  .  .  xM  =  sm_i,  where  m  is  related  to  referenced 

time  delay  or  compressed  range  cells. 


y 

Fig.  4  —  Simplified  digital  receiver 

We  designate  the  compressed  range  cells  as  r,,  r2,  .  .  rM.  The  matched  filter  output  for  the 
data  contained  in  the  SR  for  each  range  cell  is 

rl  :  Jl  =  Si*i+S2*2  +  ---  +*nxn 

r2  '  yi  -  s  1*2  +  ■s2r3  +  •  •  •  +  S^x N  +  ! 

r3  :  yi  1*3  +  ^2*4  +  ...+%%  +2 

.  (6) 

rM  '•  Tm  =  s  \XM  +  s  2*  1  +  ■  •  •  +  $NXN  -  1  > 
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where  ym ,  m  —  1,2 ,  ,  M  represent  the  compressed  output  for  each  range  cell. 

Let 


X  : 

=  (*1>  *2 

,  •  •  •  ,  xm)t 

(7) 

y  : 

=  (y  u  y2 

, ■ •  • ,  yiuf 

(8) 

*1 

s2 

*3  ••• 

sN  0 

.  0  " 

0 

■*1 

52  ... 

sN-l  SN 

.  0 

0 

0 

s1  ... 

SN- 2  SN- 1 

.  0 

• 

.  . . . 

. 

•  •  • 

(9) 

^2 

*3 

... 

0  0 

..  Si 

We  can  then  show  that 

y  =  S*x.  (10) 

However  5  is  a  circular  Toeplitz  matrix  [5]  and  can  be  written  in  the  form 

S  =  BAB*  ,  (11) 


where  the  Butler  matrix 


B  =  (ri?*“1)(,,_1));  m,n  =  1,2,...,  M, 


(12) 


I’m 


the  diagonal  matrix 


A  =  Ow);  m  =  1,2,...  ,M , 


(13) 


and 


"i;  sk+irft~l* 


x/M  *  =  o 


m  =  1,2, . . .  ,M . 


Hence  y  can  be  rewritten  as 


(14) 


y  =  B*A*Bx.  (15) 

As  a  result,  the  matched  filter  implementation  can  be  configured  as  shown  in  Fig.  5.  Note  that  this 
implementation  is  equivalent  to  a  circular  convolution  [4], 
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Fig.  5  —  New  matched  filter  implementation 

An  efficient  implementation  of  this  multiple  channel  matched  filter  is  accomplished  by  using  Ap¬ 
point  fast  Fourier  transforms  (FFTs)  which  is  equivalent  to  multiplying  by  the  B  matrix;  and  by  using 
M -point  inverse  FFTs,  which  is  equivalent  to  multiplying  by  B* .  This  implementation  is  shown  in 
Fig.  6.  The  input  data  structure  is  as  illustrated  in  Fig.  7,  for  N  =  8  and  M  =  16.  Note  that  the 
input  data  structure  differs  from  that  of  the  linear  convolver  in  that  the  “matching”  in  most  cases 
actually  occurs  over  two  adjacent  pulses.  In  the  example  seen  in  Fig.  7,  three  subpulses  s6,  s-j,  in 
pulse  1  are  inputted  along  with  the  five  subpulses  Sj,  s2,  s 3,  s4,  s5  in  pulse  2  into  the  matched  filter 
(circular  convolver  implementation). 


Vi  y2  yM 


Fig.  6  —  Efficient  implementation  of  new  matched  filter 
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Doppler  processing  is  also  implementable  in  conjunction  with  using  the  circular  convolver  as  a 
matched  filter  or  pulse  compressor  as  illustrated  in  Fig.  8.  Here,  we  have  inserted  a  Doppler  filter 
bank  before  pulse  compression  (note  that  ideally  the  operations  of  Doppler  filtering  and  pulse 
compression  can  be  reversed  with  no  differences  in  the  respective  output  channels).  The  Doppler 
filter  bank  samples  the  input  data  stream  x(t)  every  PRI,  which  we  set  equal  to  T.  We  make  the  fol¬ 
lowing  definitions.  Let  ,  ...  ,  ajj-1.  | ) ^  be  the  K  Doppler  coefficients  on  the  l  th 

Doppler  filter.  Let  L  be  the  number  of  Doppler  filters  and  <j>  be  the  Doppler  phase  shift  over  one 
PRI  of  our  desired  signal.  Thus  the  output  s  "  of  the  /th  Doppler  filter  related  to  the  /zth  subpulse  of 
the  received  desired  signal  is 

s'n  =  +  a[l)  sne >+  +  a$\ ne^  +  . . .  +  a$lxsn  .  (16) 


=  s„ 


K- 1 

£  aP  ejH 

k=  0 


=  Snfl(<t>), 
where  we  have  defined 


M)  =  V  al‘>e*+  (17) 

k  =0 

to  be  the  Doppler  filter  gain  factor  of  the  /th  filter.  Hence  the  output  data  stream  of  the  desired  sig¬ 
nal  through  the  /th  filter  looks  as  is  shown  in  Fig.  9.  Note  that  the  input  data  stream  for  the  desired 
signal  is  multiplied  by  the  Doppler  filter  gain  factor,  which  is  range  independent,  and  also  that  the 
basic  periodic  structure  is  retained.  Thus  this  signal  can  be  matched  filtered  by  using  the  circular 
convolution  procedure  as  if  there  were  no  Doppler  processing  present. 

III.  NUMERICAL  EFFICIENCY  COMPARISON 

The  total  number  of  complex  multiplications  operations  (CMOPs)  per  PRI  to  implement  the  cir¬ 
cular  convolution  algorithm  is  tabulated  below: 


CMOPs 


M- pt  inverse  FFT  : 

.5 M  log2  M 

X  weighting  : 

M 

M- pt  FFT  : 

,5M  log2  M 

Total  : 

M  log2  2 M 

We  compare  this  with  the  number  of  CMOPs  per  PRI  if  the  standard  matched  receiver  (Fig.  2)  is 
implemented. 


In  Fig.  2,  x(m),  m  =  0, 1,. . .  ,  are  consecutive  samples  of  x(t).  This  sequence  is  convolved 
with  the  matched  finite  impulse  response  filter,  which  has  coefficients  sx,  ...  ,sN  to  form  the  matched 
output  sequence:  y(0),  y(l),  ....  If  standard  convolution  is  employed  to  generate  a  single  output 
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PULSE  1 


PULSE  2 


x(t) 


V 


M  =  16  SAMPLES  INPUTTED  INTO  MATCHED 
FILTER  (CIRCULAR  CONVOLVER) 


Fig.  7  —  Input  data  structure  into  the  circular  convolver 


DOPPLER 
FILTER  BANK 


MATCHED 

s  =  (s1-  s2 . sn)  filter 

ae  =  (atf\  af>,  a®_1)T 


Fig.  8  —  Doppler  processing  and  matched  filtering 


PULSE  1 


S3 

S4 

S5 

S6 

S7 

S8 

_ 

_ 

— 

_ 

, _ 

,  MULTIPLIED  BY  f,(0) 


i  1 1 1  i  i  i 


PULSE  2 


x(t) 


r-  4  M  =  16  SAMPLES  INPUTTED  INTO  MATCHED 
V  FILTER  (CIRCULAR  CONVOLVER) 


Fig.  9  —  Input  data  structure  after  Doppler  processing 
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point,  N  CMOPs  are  necessary  (see  Eq.  (5)).  Thus  to  generate  M  outputs  (or  all  the  matched  outputs 
over  a  PRI),  MN  CMOPs  are  needed. 

If  fast  convolution  techniques  are  employed,  then  we  can  show  that  to  generate  N  outputs  (over¬ 
lap  and  add  technique  [4]),  then  2N  log2  4N  CMOPs  are  needed  (see  tabulation  below). 


2N  FFT  of  N-pt  input  : 

internal  multiply  by  the  DFT  {s}  : 

2 N  inverse  FFT  : 
Total  : 

Hence  to  generate  M  points, 


points  are  needed. 

We  list  the  number  of  CMOPs  per  PRI  for  each  algorithm 

Algorithm  CMOPs  per  PRI 

New  M  log2  2 M 

Fast  Convolve  M  log2  16A2 
Standard  Convolve  MN 

Thus  if  2 M  <  1 67V2,  then  the  circular  convolutional  matched  filter  is  more  efficient.  For  example,  if 
the  duty  cycle  of  the  waveform  is  50%,  then  M  =  2N .  The  new  algorithm  takes  2N  log2  AN 
CMOPs,  and  the  fast  convolve  algorithm  takes  4 N  log2  AN  CMOPs.  Hence  the  new  algorithm 
requires  half  as  many  CMOPs  as  the  fast  convolution  algorithm.  In  fact,  for  d  =  1  or  100%  duty 
cycle  waveform,  the  number  of  CMOPs  of  the  circular  convolution  algorithm  is  slightly  less  than  half 
of  the  fast  convolution  algorithm. 

If  we  define  the  duty  cycle  d  to  be  the  “on”  time  divided  by  the  PRI,  then  d  =  N/M.  Also, 
if  p  is  the  pulse  compression  ratio,  then  p  =  N.  If  BPRI  is  the  radar  waveform  bandwidth-PRI  prod¬ 
uct  that  is  equal  to  the  number  of  range  cells  per  PRI,  then 

N  =dM  —  d  BPRI .  (19) 

Using  these  above  definitions,  we  can  show  that 

no.  COMPs  using  fast  linear  convolution  4  +  2  log2  p 

- 2 -  =  -  (20) 

no.  CMOPs  using  circular  convolution  1  +  log2  BPRI  v  ’ 

In  Fig.  10,  we  plot  the  ratio  expressed  in  Eq.  (20)  as  a  function  of  the  number  of  range  cells  per  PRI, 
BPRI,  and  the  pulse  compression  ratio  p.  Note  that  if  the  ratio  given  in  Eq.  (20)  is  greater  than  one. 


CMOPs 
N  log2  2 N 

2 N 

N  log2  2 N 
2N  log2  AN ' 

M 

—  (2 N  log2  AN)  =  2 M  log2  AN  (18) 
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•  p  =  PULSE  COMPRESSION  RATIO 

•  p  s  BPRI 


0 


111! 


10 


100 


1000 


10000 


NUMBER  OF  RANGE  CELLS  PER  PRI  (BPRI) 


Fig.  10  —  CMOP’s  ratio  vs  BPRI  and  p 


then  the  circular  convolution  implementation  is  more  numerically  efficient  than  the  fast  linear  convo¬ 
lution.  Also  observe  that  the  circular  convolution  implementation  tends  to  be  more  efficient  for  larger 
pulse  compression  ratios. 

Using  the  above  formulation,  we  can  show  that  the  circular  convolution  algorithm  is  more  effi¬ 
cient  when 


(21) 


This  is  illustrated  and  plotted  in  Figs.  11  and  12.  The  region  above  the  straight  line  indicates 
values  of  duty  cycle  (or  pulse  compression  ratio)  and  BPRI  where  the  circular  convolution  implemen¬ 
tation  is  more  numerically  efficient  than  the  fast  linear  convolution. 

Finally,  we  note  that  there  is  a  settling  time  of  one  extra  PRI  associated  with  using  the  circular 
convolutional  matched  filter  as  compared  to  using  linear  convolution.  This  is  because  the  circular 
convolutional  matched  filter  in  most  cases  uses  data  from  adjacent  PRI.  Hence,  initially  upon  recep¬ 
tion  for  a  target  at  a  given  range,  there  is  target  data  in  one  PRI  but  not  in  the  preceding  PRI  which 
leads  to  an  incomplete  circular  match.  Furthermore,  for  finite  length  pulse  trains,  the  last  PRI  will 
not  be  properly  matched  because  the  succeeding  PRI  has  no  target  data.  Also  note  that  the  absolute 
settling  time  of  any  matched  receiver  depends  on  the  PRI  (the  possibility  of  second,  third,  etc.  time 
around  returns)  and  whether  there  is  clutter  processing. 
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Fig.  1 1  —  Duty  cycle  vs  BPRI 


Fig.  12  —  Pulse  compression  ratio 
vs  BPRI 
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